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Abstract. A Conditional Tree Pattern (CTP) expands an XML tree pattern with labels 
attached to the descendant edges. These labels can be XML element names or Boolean 
CTP’s. The meaning of a descendant edge labelled by A and ending in a node labelled by 
B is a path of child steps ending in a B node such that all intermediate nodes are A nodes. 
In effect this expresses the until B, A holds construction from temporal logic. 

This paper studies the containment problem for CTP. For tree patterns (TP), this 
problem is known to be CONP-complete. We show that it is PSPACE-complete for CTP. 
This increase in complexity is due to the fact that CTP is expressive enough to encode 
an unrestricted form of label negation: * \ a, meaning ’’any node except an o-node”. 
Containment of TP expanded with this type of negation is already PSPACE-hard. 

CTP is a positive, forward, first order fragment of Regular XPath. Unlike TP, CTP 
expanded with disjunction is not equivalent to unions of CTP’s. Like TP, CTP is a natural 
fragment to consider: CTP is closed under intersections and CTP with disjunction is equally 
expressive as positive existential first order logic expanded with the until operator. 


1. Introduction 

Tree Patterns, abbreviated as TP, are one of the most studied languages for XML documents 
and used in almost all aspects of XML data managment. Tree patterns are a natural language: 
over trees, unions of tree patterns are equally expressive as positive first order logic [3]. Also, 
like relational conjunctive queries, the semantics of TP’s can be given by embeddings from 
patterns to tree models [I]. Equivalence and containment of TP’s is decidable in PTime for 
several fragments [Tj, and coNP complete in general m- Tree patterns can be represented 
as trees, as in Figure [lj or be given a natural XPath-like syntax. 

In this paper, we study the expansion of tree patterns with the conditional descendant 
axis. We call this expansion Conditional Tree Patterns , abbreviated as CTP. Where the 
descendant axis in TP can be written as the transitive reflexive closure of the XPath step 
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Figure 1. The tree pattern corresponding to the XPath expression 
/p [t] //s [. //v] . The node in the square box denotes the output node. 



Figure 2. 


Conditional tree pattern corresponding to (1.1). 


child::*, the conditional descendant axis is the transitive closure of child :: p[Fi] [F 2 ] • • • [F n ], 
where n > 0 and each Fj is an XPath expression which might contain the conditional 
descendent axis itself, followed by a child step. Syntactically, the expansion is straightforward: 
in the tree representation add labels representing conditional tree patterns themselves to the 
edges. Figure [2] contains an example which is equivalent to the following XPath-like formula 
in which we use (•)* to denote the reflexive transitive closure of a path formula: 

self :: pfchild :: t]/(child :: r)*/child :: s 

[(child :: a[(child :: a)*/child :: b])*/child :: v] (1.1) 

The edge in Figure [2] labeled by r corresponds to the expression (child :: r)*/child :: *, and 
merely states that all intermediate nodes are labeled by r. The other labeled edge shows a 
nesting of patterns, corresponding to a nested transitive closure statement: all intermediate 
nodes have to be labeled by a and moreover have to start an a-labeled path ending in a 
6-node. 

Conditional tree patterns are the forward fragment of Conditional XPath m without 
disjunction and negation. The conditional descendant axis is closely related to the strict 
until operator from temporal logic mm- 
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Main results. Our main results concern the expressivity of CTP and the complexity of the 
containment problem. We consider two types of models: the standard XML trees in which 
each node has exactly one label, and trees in which nodes can have an arbitrary number 
of labels. These latter, called multi-labeled trees , are the models considered in temporal 
logic. All our results hold for both semantics. Models with multiple labels are a convenient 
logical abstraction for reasoning about tree patterns expanded with attribute value equalities. 
These are expressions of the form @ a = c, where a is an attribute, c a constant, meaning 
that it holds at a node if and only if the value of the a-attribute of the node equals c. (With 
that we can express XPath formulas like //table [©border = ’l’]). 

Containment of tree patterns has been studied in mmm- The most relevant results 
for this paper are that containment of TP’s is coNP-complete in general (151 and PSpace- 
complete when the domain of labels is finite [[IS] . We show that containment of CTP’s is 
PSPACE-complete (with both finite and infinite domain of labels). Interestingly, this increase 
in complexity is due to the fact that CTP is expressive enough to encode an unrestricted form 
of label negation: * \ a, meaning “any node except an a-node”. We show that containment 
of TP’s expanded with this form of negation is already PSPACE-hard. The matching upper 
bound for CTP containment is easily obtained by a translation into Existential CTL HD- 
As a contrast we consider the expansion of TP with a safe form of propositional negation 
n \ a , which selects nodes with the label containing n and not a, instead of * \ a [3- Note 
that this construct only makes sense on models with multiple labels. With respect to 
expressivity, we show that most results for TP can be generalized to CTP. CTP’s can be 
interpreted in trees by generalizing the TP-embeddings to simulations known from temporal 
logic. Similarly to the characterization for TP in (3], we show that CTP’s with disjunction 
and union are equally expressive as positive first order logic expanded with an until operator. 
From this we obtain that like TP’s, CTP’s are closed under taking intersections. 

Organization. The paper is organized as follows. This section is continued with a few 
more comparisons between TP and CTP, related work and a motivating example. Section [2] 
contains preliminaries. Section [3] contains the expressivity results and Section [4] the results 
on the complexity of the containment problem. We end with conclusions and open questions. 

Comparing logical properties of TP and CTP. A characteristic difference between TP 
and Relational Conjunctive Queries (CQ) is the disjunction property: if A \= B \/ C, then 
A |= B or A |= C. This holds for CQ, but not for TP. A counterexample is //p \= /p V /*//p- 
The languages TP and CTP differ on the following: 

Unions: TP expanded with disjunction is equally expressive as unions of TP [3]. However, 
CTP with disjunction is more expressive than unions of CTP. 

Countermodels: If containment between two TP's fails, there is a countermodel for it of 
polynomial size m ■ Countermodels for CTP containment may be exponential. 
Complexity: TP containment is coNP-complete [15] and CTP containment is PSpace- 
complete. For both languages this remains true if we add disjunction to the language 
(for TP see [16]). 

However, there are a few useful technical results that TP and CTP share. 

Monotonicity: TP and CTP formulas are preserved under extensions of models at the 
leaves (i.e. when the original model is a subtree of the extension). This means that if a 
TP (CTP) formula holds in a tree, then it holds in the extensions of the tree. 
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Multiple output nodes: The containment problem for TP and CTP formulas with multi¬ 
ple output nodes can be reduced to containment of Boolean TP and CTP formulas (i.e. 
when the formula has a single output node which is the root). 

Containment for unions: Containment for unions of TPs can be reduced in PTlME to 
checking a set of containments between TP formulas [15!]. This reduction can be seen as 
a weak disjunction property. A similar property holds for CTP (see Proposition 4.2). 
Multi-labeled models: There is a PTime reduction from the containment problem over 
trees in which each node has exactly one label to the containment problem over multi- 
labeled trees. This holds for both TP and CTP. 

For TP, most of these results are in m- Here we show how their proofs can be generalized 
to CTP. 


Motivation. Tree patterns exhibit a nice tradeoff between expressive power and the com¬ 
plexity of static analysis. However, there are natural scenarios where tree patterns are 
not powerful enough, e.g. in Example 1.1 The conditional axis gives us more querying 


capabilities while preserving some of the nice properties of tree patterns (see the comparison 
above). 


Example 1.1. Conditional tree patterns are used to query tree shaped structures. As an 
example, take the tree structure of the UNIX file system. In this file system, every hie and 
directory has different access permissions (read, write or execute) for different type of users 
(the user, the group and others). Thus, the hie system can be modeled as a tree where each 
node corresponds to a directory or a hie (labeled by ”dir” and ’’file” respectively) and has a 
required attribute for each pair (user, access right) which takes values from {0,1}. A hie 
can only be a leaf in the tree. 

Assume we want to ask for all the hies that are readable by the user. This means that 
we are looking for precisely those hies for which the following permissions hold: 

• the hie is readable for the user, 

• the directory in which the hie resides is both readable and executable for the user, 

• the same holds recursively for all the directories from the root to the hie. 

This query can be neatly expressed as a CTP path formula: 

/(child :: dir[@ (usei . jread) = 1] [@( useri execute) = l])7 child :: f ile [@(user,read) = l]- 

Additionally, CTP can express non-trivial constraints over the tree representing the hie 
system. For instance, the formula 

//self .. file[@ (other,read) l] t /(child .. dir]@( 0 ther,read) l][®(other,execute) l]) /* 

imposes the constraint that for all hies which are readable it holds that every directory on 
the path from the root to this hie must be both readable and executable by others. 


It is known that the conditional axis cannot be expressed by tree patterns or in Core 
XPath [13] . On the other hand, the queries from Example |1.1| can be expressed in the positive 
forward fragment of Regular XPath, which is more expressive than CTP. However, we 
believe this additional expressive power leads to increase in the complexity (of containment) 
than for conditional tree patterns. 
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1.1. Related Work. The complexity of tree patterns is studied in a number of papers 
and since [16] a virtually complete picture exists for the complexity of the containment 
problem for positive fragments of XPath. Miklau and Suciu m show that the complexity 
of the containment problem for TP with filters, wildcard and descendant is coNP-complete. 
Containment and equivalence for fragments of TP were studied before. The most interesting 
result is that containment for TP without the wildcard is in PTime [1J. 

Our conditional tree patterns are a first order fragment of conjunctive regular path 
queries. Calvanese et al. [6] show that the complexity of containment of these queries is 
ExpSPACE-complete, but these are interpreted on general graph models. 

Conditional XPath [13] and conditional tree patterns are closely related to branching 
time temporal logic CTL [8j. The conditional child axis and the strict until connective are 
inter definable. Kupferman and Vardi m show that the containment problem for 3CTL, 
which is the restriction of CTL to formulas having only the 3 path quantifier in front 
of them, is a PSPACE-complete problem. The positive 3CTL-fragment without until was 
also studied by Miklau and Suciu m- They show that the containment problem for this 
fragment is equivalent to the TP-containment problem and thus also coNP-complete. 

This paper studies the containment problem without presence of schema information. 
A number of complexity results for containment in TP w.r.t DTDs are given in [16] . In 
particular, containment for TP with Liters, the wildcard and descendent w.r.t DTD is 
ExpTlME-complete. This hardness result together with an ExpTime upper bound for 
Conditional XPath m, which contains CTP, gives us ExpTlME-completeness of containment 
for CTP in the presence of DTDs. 


2. Preliminaries 

We review the basic definitions of XML trees, Regular XPath and its semantics. Then we 
present Tree Patterns and Conditional Tree Patterns as sublanguages of Regular XPath. 
Tree patterns have an alternative semantics in terms of embeddings m- We give such an 
“embedding semantics” for Conditional Tree Patterns using “until-simulations” in Section [3] 

2.1. Trees. We work with node-labeled unranked finite trees, where the node labels are 
elements of an infinite set X. Formally, a tree over X is a tuple (N, E,r, p), where N, the 
set of nodes of the tree, is a prefix closed set of finite sequences of natural numbers, E = 
{((ni,..., 71^), {ni ,..., rik, nk + 1 }) | (ni,..., rik+ 1 ) E N} is the edge or child relation, r is 
the root of the tree, that is the empty sequence, and p is the function assigning to each node 
in A" a finite subset of X. We refer a tree over X just as a tree if X is clear from the context. 

Trees in which p(-) is always a singleton are called single-labeled, or XML trees. Trees 
without this restriction are called multi-labeled trees. 

We denote by E + the descendant relation, which is the transitive closure of the edge 
relation E, and by E* the reflexive and transitive closure of E, and by E(x) the set of all 
children of the node x. A node n is a leaf if E(n) is empty. A path from a node n to a 
node m is a sequence of nodes n = no,..., nj- = m, with k > 0, such that for each i < k, 
)_i E E(ni). We call a branch any maximal path starting from the root. If mE + n , m is 
called an ancestor of n, and if mEn, m is called the parent of n. 

If n is in N, by T.n we denote the subtree of T rooted at n. A pointed tree is a pair 
T, n where T is a tree and n is a node of T. The height of a pointed tree T, n is the length 
of the longest path in T.n. 


6 


A. FACCHINI, Y. HIRAI, M. MARX, AND E. SHERKHONOV 


Given two trees T\ = (N\, E\, r\, p{) and T 2 = (N 2 , E 2 , r 2 , P 2 ) such that N\ and N 2 are 
disjoint, we define the result of fusion of Tf and T 2 , denoted as Tf © T 2 , as the tree obtained 
by joining the trees T\ and T 2 without the roots under a new common root labeled by the 
union of the labels of the roots of T\ and T 2 . Formally, Tj © T 2 is the tree T = ( N , E. r, p ), 
where N = (N\ \ {ri}) U (N 2 \ {r 2 }) U {r}, E = (E x \ {(ri,n) | n G Ni}) U (E 2 \ {(r 2 ,n) | 
n G N 2 }) U {(r, n) | (ri,n) G E\ or (r 2 ,n) G E 2 } and 
pi(ri) U p 2 (r 2 ) if n = r, 

P{ n ) = Pi(n) if n G iVi \ {n}, 

P 2 {n) if n G N 2 \ {r 2 }. 


2.2. XPath and Tree Patterns. We define Tree Patterns and Conditional Tree Patterns 
as sublanguages of Regular XPath [El- 

Definition 2.1 (Forward Regular XPath). Let X be an infinite domain of labels. Forward 
Regular XPath, RXPath for short, consists of node formulas p and path formulas a which 
are defined by the following grammar 

p ::= p | T | -1 p | p V p | p A p \ (a)p 
a ::= f \ Ip \ a; a \ a U a \ a*, 

where p G X. 


We will use a + as an abbreviation of a; a*. 

For the semantics of RXPath, given a tree T = (N, E,r, p) over X, the relation \o\t E 
N x N for a path expression a and the satisfaction relation |= between pointed trees and 
node formulas are inductively defined as follows: 

• Mr = E, 

• PV]t = {(n, h)gJVxJV T,n\=p}, 

• [<*;/% = lajr 0 [/ 3 ] t , 

• [a U /3 Jt = Hr U [/3 ]t, 

• [a*lr = (Hr)*, 

and 


T, n 


• T, n |= p iff p G pin) , 

• T,n \= -<tp iff T, n p , 

• T, n |= p A if iff T, n |= p and T, n |= if, 

• T, n j= p V if iff T, n \= p or T, n \= ip, 

• T, n |= (a) p iff there is a node m with (n, m) G [air an d T,m \= 

Sometimes we will write T |= p to denote T, r |= p. If the latter 
a model of p. 


p. 

holds, we say that T is 


(Conditional) Tree Patterns. Tree patterns are the conjunctive fragment of RXPath without 
unions of paths and with a strongly restricted Kleene star operation. Node formulas 
correspond to Boolean tree patterns, and path formulas to tree patterns with one output 
node. 

In the following definitions we again let X be an infinite domain of labels. We define 
Tree Patterns by restricting the syntax of RXPath as follows: 
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Definition 2.2 (Tree Pattern). Tree Patterns (TP) consist of node formulas p and path 
formulas a defined by the following grammar: 

p ::= p | T | p A p \ (a)p 
a ::= | | Ip | ct; a | | + , 

where 

For example, the tree pattern from Figure [l] can be written as the path formula 
a =?(p A (|}f); l + ; ?(s A (i)v). Conditional Tree Patterns are also defined by restricting the 
syntax of RXPath, where we allow conditional descendant paths. 

Definition 2.3 (Conditional Tree Pattern). Conditional Tree Patterns (CTP) consist of 
node formulas p and path formulas defined by the following grammar: 

p ::= p | T | p A p \ (a)p 
a ::= i\lp\a\a \ (|;?4>*4 , 

where p G S. 

The tree from Figure [2] can be written as the path formula 

?(P A 4>f); U; ?r)*;|; ?(s A <(|; ?(o A <(|; ?a)*; 1)6))*; i)v). 

Note that the node formula ((|; Ip)*]^)^ is exactly the strict until 3U(ip, p) from branching 
time temporal logic CTL D31. We will abbreviate this formula simply as (p + )ip. Because 
of the equivalences (a; /3)p = ( a)(/3)p and ( r ?p)ip = p A V’, CTP node formulas can be given 
the following equivalent definition: 

p ::= p\T \p/\p\ (\)p\ (\+)p\ (p + )p. 

Even though the syntax is slightly different, CTP is the conjunctive forward only fragment 
of Conditional XPath |iI3|. 

Expansions. We consider two expansions of TP and CTP, with negated labels and with 
disjunction in node formulas together with union in paths. Negated labels is a restricted 
type of negation where only the construct —>p is allowed in the node formulas. We denote 
expansions of the language L with one or two of these features by L s for S C {—■, N/}. 

Query evaluation. In |9j it was shown that the query evaluation problem for Core XPath 
is PTiME-complete in the combined complexity. As noted in m , using results on model 
checking for Propositional Dynamic Logic, this can be extended to Regular XPath, and thus 
to all our defined fragments. 

Fact 2.4 (H3]). Let T be a tree, n\,n 2 nodes in T, and a a Regular XPath path formula. 
The problem whether (ni,n 2 ) G is decidable in time 0(|T| x |a|) with \T\ the size of 
the tree and lal the size of the formula. 
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2.3. Containment. As we are considering two kinds of expressions, path and node expres¬ 
sions, we have different notions of containment. 


Definition 2.5. Let ip and ip be two RXPath-node formulas. We say that p is contained 
in ip, notation p C ip, if for every T, n, T, n |= p implies T,n \= ip. Let a and j3 be two 
RXPath-path formulas. We say that 

• a is contained as a binary query in /3, denoted a C 2 f3, if for any tree T, [[ajr Q |[/3]]t, 

• a is contained as a unary query in (3, denoted a C 1 /?, if for any tree T with root r, and 
any node n, (r, n) E [a]]T implies (r, n) E [/3 |t ■ 

Containment over single-labeled trees is denoted by C, and containment over multi-labeled 
models by C M |_. 


Luckily, these three notions are closely related, and containment of path formulas can 
be reduced to containment of node formulas and vice versa (cf. also [16j |T5j ). 

Proposition 2.6. Let a and (3 be two RXPath -path formulas, p andip RXPath -node formulas, 
in which negation is restricted to labels only. 

(i) a C 2 f3 iff a C 1 j3, 

(ii) Let p be a label not occurring in a or in p3. Then, a C 2 (3 iff (a)(\.)p C (/ 3)(\.)p , 

(hi) pQipiff^P <^ 2 3ip. 

All the above items also hold for the case of multi-labeled trees. 


Proof, (i) (=>) Let T = ( N,E,r,p ) be a tree such that (r, n) E [[a]]T- By the assumption, 
we have [oJt U \(3\t- This implies that (r, n) E [[/?]]y, which was required to show. 

(<£=) Let T = ( N,E,r,p ) be a tree, n\ and ri 2 in N such that (ni,n 2 ) E [[a]]T- Since a 
is from the forward fragment of Regular XPath, we have that (ni, 112 ) E |a]]T.ni- Using the 
assumption, we have (ni,ri 2 ) E [[/3]]T.ni- Then by monotonicity, we obtain (ni,ri 2 ) E [[/3]t, 
which was desired. 

(ii) (=>) Suppose a C 2 (3, and let T, n |= (a)(l)p. Thus there is a descendant m of 
n in T such that (n,m) E [ajr and T, m |= (i)p. By the hypothesis we therefore have 
( n,m ) E 1(3}t and thus T,n\= {/3)(i)p. 

(<^=) Assume (a)(\.)p C (/3)(|)p• Consider a tree T = ( N,E,r,p ) and a pair (n,m) E 
[[ajT- Since p does not occur in a, we may assume that T,n\f= p for every node n E N. We 
then define the tree T' = (N ', E ', r, p'), where 

• N' :=NU{x}, 

• E' := E' U {(m, x)}, 

p if y = x, 

p(y ) otherwise. 

By definition of T', we have T',m \= (l)p and (n,m) E [oIt'- Hence, T',n |= (a)(l)p. 
By the assumption, it implies that T ', n \= (/3)(\.)p. Since p holds at x only, and x is a child 
of m, we have that (n,m) E [[/3 ]]t'- By definition of T' , the path between n and m is also in 
T. Thus, (n, m) E [[/3]]t) as desired. 

Item (iii) easily follows from the definitions. □ 


p'(y) ■= 


In light of Proposition 2.6, from now on we consider the containment problem of node 


formulas only. We now give an interesting example of CTP containment. 
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Example 2.7. Let us consider the CTP node formulas 

<p = (((b + )d) + )(a + )b and 

^ = <(( c + )d) + ){a + )b. 

Although it is hard to see from the first glance, <p C ip holds. Indeed, in every model T of ip 
either there is a direct child of the root where ( a + )b holds, or there is a path r = vi,...,v n 
in T to the node v n where ( a + )b holds and (b + )d holds in every node Vi (1 < % < n) on the 
path. In the first case, ip holds at the root since ( a + )b holds at the direct child. 

Now let’s consider the second case. Let j £ {2,..., n — 1} be the least number with the 
property that Vj has a non-empty 6-path to the d-descendant. If there is no such number, 
then (c + )d holds at every Vi, 1 < i < n (as each of them has a direct d-child) and, thus, 
T,r \= ip. 

Assume there is such a j. Since Vj has a 6-node as a child, we have that T, Vj |= (a + )6. 
Moreover, (c + )d holds at each Vi for i < j since Vi has a d-node as a child. Thus in this case 
we obtain that T,r \= ip holds too. 


3. Expressivity 

We extend the semantics of TP given by embeddings of queries into trees to CTP. Instead 
of embeddings we need a simulation known from temporal logic. 

3.1. Interpreting Conditional Tree Patterns by simulations. The semantics of condi¬ 
tional tree patterns can be defined using simulations developed for LTL [5]. These simulations 
generalize the embeddings for tree patterns from in esi with an additional clause for checking 
the labels on the edges. 

We start with defining the tree pattern analogues of CTP node and path expressions. 

Definition 3.1. A conditional tree pattern is a finite tree (N, E,r,d, pisr, pe) with labeled 
nodes and edges, where N is the set of nodes of the tree, E C N x N is the set of edges, r 
is the root of the tree, d is a fc-tuple (k > 0) of output nodes, pn is the function assigning to 
each node in N a finite set of labels from £ and pe is the function assigning to each pair in 
E either or a Boolean conditional tree pattern. 

A Boolean conditional tree pattern is a conditional tree pattern with a single output node 
which equals to the root. 

A conditional tree patterns is said to have multiple output nodes if the number k = |o| is 
greater than 1. 

A tree pattern is a conditional tree pattern whose edges are only labeled by or T. 

To be consistent with the pictorial representation of TP, in CTP patterns an edge labeled 
with 4- is drawn as a single line, while an edge labeled with a CTP node formula is drawn as 
a double line with a CTP pattern as the label (e.g. as in Figure [2]). The output nodes have 
the square shape. 

CTP node and path expressions can be translated into (Boolean) conditional tree patterns 
with one output node and vice-versa. The translations are given in Appendix |A| We denote 
the equivalent (Boolean) conditional tree pattern of a CTP path or node expression a or p 
by c(a) and c((p), respectively. 

Next we generalize the notion of TP-embeddings to CTP-simulations. 
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Definition 3.2. Let T = (N, E, r, d, pN, Pe) be a conditional tree pattern as in the previous 
definition and T' = (N', E' ,r', p') a tree. A total function / : N —> N' is called a simulation 
from the pattern T into the pointed tree T' , r' if it satisfies the following properties: 

root preserving: f(r) = r' \ 

label preserving: if p £ pj\r{n), then p £ p' (f(n))-, 
child edge preserving: if nEn' and pE(n,n') — \!, then f (n)E f (n')\ 
conditional edge simulation: if nEn' and pE(n,n') is not equal to 4-, then f(n)E' + f(n') 
and for every x such that f(n)E' + xE' + f(n') there is a simulation from the Boolean 
conditional tree pattern pE(n,n') into T'.x (the subtree of T' rooted at x). 

When the pattern is a tree pattern, f and T are the only labels on edges. For the label 
T, the ’conditional edge simulation’ clause trivializes to checking that / is an embedding 
for the descendant edges. Simulations for tree patterns are thus equivalent to tree pattern 
embeddings mm- 

The next theorem states that simulations can be used to evaluate conditional tree 
patterns. 


Theorem 3.3. Let p and a be a CTP node and path expression, respectively. Let T be a 
tree and n a node in T. 

(i) T,n \= p if and only if there is a simulation from c (p) into T.n. 

(ii) (n, n') £ [ajr if and only if there is a simulation from c(a) into T.n which relates the 
output node of c(a) to node n!. 

(iii) Items (i) and (ii) also hold when T is an infinite tree. 


The proof is by mutual induction on the node and path expressions. 

Remark 3.4. The notions of (conditional) tree pattern and embedding for tree patterns 
and simulation for conditional tree patterns are easily extended to work for expansions of 
these languages with negated labels (denoted by TP^ and CTP'’, respectively). 

For (conditional) tree patterns, add a second node labelling function p])(-) which also 
assigns each node a finite set of labels. These are interpreted as the labels which are false at 
the node. 


For the embeddings and simulations, in Definition 3.2 we add a clause stating that also 
negated labels are preserved: 

negated label preserving: If p £ p^(n), then p 0 p'(f(n)). 

With these modifications, Theorem |3.3| also holds for TP^ and CTPT 


3.2. Expressivity characterization. In this section we give expressivity characterization 
for CTP similar to the one for various fragments of XPath in [3]. The exact logical 
characterization allows one to compare different fragments of (Regular) XPath and derive 
non-trivial closure properties such as closure under intersection. We show that CTP path 
formulas correspond to a natural fragment of first order logic (FO) and are closed under 
intersections. 

For ip a formula, let desc^(x, y) be an abbreviation of the ’’until” formula desc(x,y) A 
\/z(desc(x, z) A desc(z,y) -A- p(z)). Let 3 + (child, desc^) be the fragment of first order logic 
built up from the binary relations child and desc , label predicates p(x) for each label p £ X 
and equality , by closing under A, V and 3 as well as under the rule: 

if <p(x) £ 3 + (child, descp), then desc^(x,y) £ 3 + (child, desc^). 


(3.1) 
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We use 3 + (child, desc^)(c, s) to denote 3 + (child, desc v ) formulas with exactly two vari¬ 
ables c and s free. Following [3] we restrict the FO fragment to its downward fragment 
3 + (child, desc v )[down](c, s ) by requiring that every bound variable as well as s is syntacti¬ 
cally restricted to be a descendant of c or equal to c. 

Note that 3 + (child, desc^)[down\(c,s) without the rule (3.1) is the logic 
3 + (child, desc)[down\(c, s) from [3j, and shown there to be equivalent to unions of tree 
patterns. Even though 3 + (child, desc v )[down](c, s) does not contain negation, not every 
formula is satisfiable (e.g. desc(x,x)). 

We can now characterize conditional tree patterns in terms of 3 + (child, desc v )[down\(c, s). 


Theorem 3.5. The following languages are equivalent in expressive power: 

• unions of the false symbol and CTP paths which can have disjunctions in the node formulas 

• 3 + (child, desCp)[dowii](c, s). 

Proof. A standard translation TR xy ( •) turns any CTP path formula a into an equivalent 
formula in 3 + (child, desCu,)[down](c, s). The translation is essentially the semantics of path 
and node formulas written in the first order language. 


TR xy (H )) 

S3 

II 

< 

^3 

<0 

II 

TR xy (l ) 

= child(x,y) 

TR xy (lp) 

= x = yATR x (<p ) 

TR X y(oi li Oi 2 ) 

= 3z.(TR xz (a\) A TR zy (a 2 )), 
where z is a fresh variable 

TR xy ( (4.; ?*>)*; 40 

= desc TRz ^)(x,y ) 

TR xy (ai U a 2 ) 

= TRxy(ai) V TR xy (a 2 ) 

TR x (p) 

= p(x) 

TR X ( T) 

= X = X 

TR x ((p 1 A tp 2 ) 

= TR x ((pi) A TR x (ip 2 ) 

TR x (ip 1 V (p 2 ) 

= TR x (p>i) V TR x (<p 2 ) 

TR x ((a)p>) 

= 3y.(TR xy (a) A TR y (<p)), 
where y is a fresh variable. 


By definition, TR cs (a ) is a formula in 3 + (child, desCy,)[down](c, s) and equivalent to a. 

For the other direction, let 6 S 3 + (child, desc v )[down](c, s). We use our earlier notation 
desc^(x,y) for the ’’until” formulas. First we introduce two special formulas T(x) and _L(.t) 
which stand for x = x and desc(x,x ), respectively. 

We will modify 6 in several steps. First we replace child(x, y ) and desc(x, y) by the 
equivalent desc±(x,y) and descj(x,y), respectively. Then eliminate all equalities by renam¬ 
ing variables. Bring all existential quantifiers inside 9 to the front, bring the body into 
disjunctive normal form and distribute the disjunctions over the quantifiers. We then end 
up with a disjunction of formulas of the form 3xip(c, s ), with (p a conjunction of formulas 
desc x j 1 (x, y), p(x), T(x) and _L(x). If ip contains _L replace 3xip(c,s) with _L. 

As our target language is closed under unions and _L, we only need to translate the 
’’conjunctive queries” 3x<p(c,s). Consider the graph of the variables in p in which two 
variables x, y are related if <p contains an atom desc^(x, y). If the graph is cyclic, it cannot 
be satisfied on a tree and 3xip(c, s ) is equivalent to _L. If it is a tree, c will be the root. 
Assuming that we can rewrite all if in the atoms desc^,(x, y), we can rewrite it as a conditional 
tree pattern. If if is a boolean combination of p(z), T(z) and _L(z) atoms, this is not hard: 
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Simply bring it into disjunctive normal form, and remove each conjunct containing _L. If 
the result is not empty, then translate to a union of (trivial) conditional tree patterns. If 
it is empty, translate desc^,(x,y) to a child step. If ip contains subformulas of the form 
desc^{x,y), we apply the current procedure to it. 

Thus assume that the variable graph is a directed acyclic graph. We eliminate undirected 
cycles step by step. The length of the cycle equals the number of variables in it. Consider 
that the graph contains two paths and 7T2 both going from x to y and without other 
common variables. We show that this subgraph is equivalent to a union of subgraphs with 
no or smaller cycles. 

In the simplest case, both paths are of length 1 and thus consist of a desc^(x,y) atom. 
But then we can use the following equivalence to remove the cycle: 

desc tp (x, y) A desc^(x, y) = desc^ A ^(x, y) ( 3 . 2 ) 

So assume the cycle looks like Figure [3j 



CC 


*2 


7T 2 


Figure 3. undirected cycle in ip 


If this is satisfied on a tree (IV, E, r, p) with assignment g, there are three possibilities: 
g(zi) = gfa), g(z±)E + gfa) or g{z 2 )E + g{z\). In each case our original formula is equivalent 
to one with a smaller cycle. The three possibilities are 
when g(zi) = g(z 2 ): desc^(x, zi) A z^y A zynf 2 y 

when g(zi)E + g(z 2 ): desc^^x, z{) A ip{z\) A ziir[y A desc^(zi, z 2 ) A Z2iT 2 y 
when g(z 2 )E + g(z\): desc ipA ^(x, z 2 ) A 92(22) A Z2ir' 2 y A desc ip (z 2 , z\) A 2i7rjy. 

In the first case the length of the cycle decreased by two, in the two other cases by one. 

Thus their disjunction is equivalent to the formula of Figure [3j We replace that formula 
by this disjunction, bring the result in disjunctive normal form and distribute the disjuncts 
out. We again have a disjunction of ’’conjunctive queries”. As the new cycles are smaller, 
this procedure will terminate, and results in a (big) disjunction of trees. M 

An important consequence of this result is that CTP patterns are closed under intersec¬ 
tion: 


Theorem 3.6. The intersection of two CTP paths is equivalent to _L or a union of CTP 
paths. 


Theorem 3.5 together with the translation of Regular XPath into FO*(c, s ) from m 
implies that every union of CTP patterns with disjunctions is in the intersection of first 
order logic and positive downward FO*(c, s). It is an intriguing open problem whether the 
converse also holds. 
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Figure 4. 


Patterns <pf and ip from Proposition 


4.2 


4. Containment 

Before we determine the exact complexity of the containment problem for CTP and expansions 
we prove a number of reductions. Most are generalizations from TP to CTP. The key new 
result is the encoding of negated labels in CTP. 


4.1. Containment Preliminaries. The following reductions will be used later in our upper 
and lower bound proofs. 


Multiple output nodes. The main difference between tree patterns and their XPath for¬ 
mulation is that tree patterns can have multiple output nodes. Kimelfeld and Sagiv [TO] 
(Proposition 5.2) show that for the TP containment problem the number of output nodes is 
not important: the problem can be PTime reduced to a containment problem of Boolean 
TPs. This is achieved, given tp, ip 6 TP, by adding a child labeled with a new label Oj to 
every output node in both p and ip. Second, to every leaf that is not an output node 
or a newly added node, we add a child labeled with T. The same result, using the same 
argument, holds for CTP. 

Proposition 4.1. Let S C {V, -i}. For CTP 5 patterns with multiple output nodes p, ip there 
are PTime computable Boolean CTP S , pfip 1 such that <p C ip iff p' C ip' . The same holds 
for multi-labeled trees. 


Disjunctions in the consequent. We now show that the containment problem for both unions 
of CTP^ and unions of TP^ can be reduced to containments without unions. This is useful in 
lower bound proofs, as we can use a union in the consequent to express multiple constraints. 
The proof of the following proposition is a slight modification of the proof for TP in [15] . 


Proposition 4.2. Let p be a CTP^ (TP^J formula and A a finite set of CTP^ (resp. TP^J 
formulas. Then there are PTime computable CTP^ fT PM formulas p' and ip such that 
<P Q V A iff <p' C ip. 
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Figure 5. Model for pattern <p' from Proposition 4.2 


If p and A are in CTP (TP), ip' and if are in CTP (TP) as well. 

The same holds for containment over multi-labeled trees. 

Proof. First, for the case of TP", the proof from [15] can be readily applied here (using 
embeddings which also preserve negated labels, cf Remark 3.4). 


Now we prove the proposition for CTP". For simplicity, we use the same letter to denote 
a CTP" formula and its corresponding tree pattern representation. Assume A is {5i,..., <5^}. 
If t is a CTP" pattern, by i a we denote the CTP" pattern defined as having the root labeled 
by a, whose unique child is the root of i. Consider new labels a and b occurring neither in ip 
nor A. Let 7 = Ao<^<fc Note that 7 is consistent if every 5# is consistent. Now define <p' 
as the CTP" tree pattern in Figure |4](a), and if as the CTP" tree pattern in Figure [4](b). 

We verify that <p C \J A iff ip' C if. Without loss of generality, we assume that each of 
<p,5i,... ,5k is consistent. 

For the direction from left to right, assume ip C \J A and T, n (= ip' for an arbitrary T, n. 

This means that there is a path n = mo. m 2 k-i of nodes in T which are labeled with a 

and such that: 

• for every 0 < £ < k: T, m t |= (5\) a , with i e {1,..., k — 1} U {k + 1__ 2k — 1}, 

• T,m k \= ( p b ) a . 

By the assumption, it follows that there is an index j such that T, rn k \= (5j) a . Then, by 
Theorem 3.3 there is a simulation f from ( dj) a into T.m k . The simulation f can be extended 
to a simulation / from if into T, n in the obvious way such that / is a simulation from (5\) a 
into T,m k -j+i and the a-descendant edge in if is simulated on the path mo,.. .m k -j+\- 
Thus, T, n |= if. 

For the other direction, assume tp' C if and T,n \= ip for an arbitrary T rooted at n. As 
all 5 1 are consistent, there are trees Tt, n£ \= 6(, for each I e {1,..., k}. Let T' be the tree 
created from T.n and the T (s depicted in Figure [ 5 ] with root r. Then T',r \= ip'. Because 
we assumed <p' C if, then also T ', r |= if, which by Theorem |3.3| implies the existence of a 
simulation g from if into T'.r. In particular, the span of k a-nodes in if has to be simulated 
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on an a-path of length k in T' . This means that for some j G {1,..., k}, g is a simulation 


from 5j into T.n. 


By Theorem 3.3 again, T,n \= 5j, and thus T,n \=\J A. 


Note that if p and A are in CTP, then the constructed p' and if are in CTP too. □ 


From single-labeled to multi-labeled trees. We now show that the containment problem over 
single-labeled trees can be reduced to a containment problem over multi-labeled trees. 

Proposition 4.3. Let S C {^,V}. Given CTP S (TP S ) patterns p , if, there are PTime 
computable CTP 5 (TP 5 ,) patterns p',iff such that p C if iff p' C ML iff. 


Proof. Let pi,.. ■ ,p n be the labels from E occurring in p or if. We take p" as p, and 
if" := if V V \<i < j< n (Pi A Pj) v Vi<i<j<n4^)(ft A Pj )• The disjuncts added to if ensure that 
every node of a counterexample is labeled with at most one label. Now we show that p C if 
iff p" C ML if" 

(=>) We show the contraposition. Let T be a multi-labeled tree such that T (= p" 
and T \f=- if". Then let T' be the tree obtained from T by restricting the labels p'(v) := 
p{v ) Cl {pi,... , Pn\ for every node v. Since T does not satisfy if", the label of every node 
in T' contains at most one symbol. Because we did not change the valuation of the labels 
pi,... ,p n , for each formula 6 constructed from these tags, it holds that T,v \= 9 iff T'. v |= 9. 
From this and the fact that the label of each node of T' is of size at most one, it follows that 
T' |= p" and T' | if". In order to make a single-labeled tree out of T 1 , we add a dummy 
symbol q in the label of a node if its label in T' is empty. For the resulting tree T" it holds 
T" |= p and T" ^ if. 

(<=) Let T be a single-labeled tree such that T (= p and T if. Then this tree can be 
considered as a multi-labeled tree. Moreover, T |= p" and T if" since T does not satisfy 
any of the disjuncts in if". 

Our transformation does not introduce negations, but only disjunctions in if". Thus 
the proposition is proven for the cases when S contains disjunction, where we take p' := p" 
and iff := if ". F or the remaining cases (S C {->}), although if" contains disjunctions, by 
there exist two PTime computable CTP‘ S (TP 5 ) patterns p' and iff such 


4.2 


Proposition 

that p" C M l if" iff p' A ml iff ■ This concludes the proof. 


□ 


Negated labels. We now show that CTP is expressive enough to encode label negation, as far 
as the containment problem is concerned. 

The trick of the encoding lies in the fact that two nodes connected by a descendent edge 
are in either child or descendant of a child relation. Additionally, if we require that neither 
of these relations occur at the same time, we can faithfully encode label negation. 

Proposition 4.4. Let p and if be CTP^ patterns. There are PTime computable CTP 
patterns p' and iff such that 

P C if iff p' C iff. 

This also holds for containment over multi-labeled trees. 

Proof. Given a containment problem p C if in CTP^, we construct an equivalent containment 
problem p* C if° with p* in CTP and if° a union of CTP patterns. Applying Proposition |4.2| 
then yields the desired result. 
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Let p\ , -ipi,... ,p n , —'Pn be the labels appearing in p and i/’ and their negations. Let 
s be a new label and £ be the formula 4)4 A f\ i (i + )pi)- We define the translation (•)* 
inductively: 

T‘ = C 

{piY = C a 4)(s a 4 )pi) 
hPi) 9 = C a 4)( s A 4)4 + >Pi) 

(9 A a)' = 6 * A a* 

(4M* = C A 4)4* 

{(p + W = CA{p- + )r. 

The translation is defined in such a way that for any p, p* implies (. Obviously (•)* is a 
PTime translation. Let 44 V 3 denote p V Let AX be the disjunction of the following 

formulas: 


(4)4 a (4)0 

(4.1) 

V4*>(KaH04 

i 

(4.2) 


(4.3) 




In our intended models, none of these disjuncts are true at the root. The disjunct 
(4.1) is technical, (4.2) states that no node makes the encodings of both p and —■ p true, 
and (4.3) ensures that no node makes the encoding of two different labels true. We define 
ip° = ijj* v AX. In case of multi-labeled trees the translation is the same, except that we do 
not include the disjunction (4.3) in the definition of i/j°. Note that although in ip° we have 
disjunctions within the scope of a modality, we can (in PTime) rewrite the formula into an 
equivalent union of disjunction free formulas. 

We now show that p C ip iff p* C -0°. 

(<£=) We show the contrapositive. Given a single-labeled tree T = (N, E, r, l ) such that 
T, r 1= p and T,r \/= 4 we build a single-labeled tree T l = (IV 7 , E l , r', l') as follows. The set 
of nodes N r is the maximal L^-connected subset of N x {0,1,2, 3} x {pi,... ,p n , 0} which 
contains the root r' = (r, 0, ft). The relation E' is defined as follows: (n 1 , 0, jj) is the parent 
of (n, 0, ft) when n' is the parent of n in T; (n, 0, jj) is the parent of (n, 1, jj); (n, 1, jj) is the 
parent of (n,2,pi),i = 1 ,...,n, and (n,2,pi) is the parent of (n,3,pi),i = 1 ,...,n. The 
labeling l' is defined as follows: s only labels nodes of the form (n, 1, jj). Then, pt labels 
(n, 2, pi) if T, n \= pt and pi labels (n, 3, pi) if T, n \/= Pi- No other nodes are labeled by pi. 
In all other cases, we label a node by a fresh label z. 

It is clear from the definition that T' is a single-labeled tree. Note that T' (A AX and 
that ( is true at all nodes of type (n, 0, jj), and only at these nodes. Let 9 be a formula in 
variables {pi,... ,p n }, and n £ N. By induction on 9, we show T, n (= 9 iff T', ( n , 0, jj) \= 9*. 

• 9 = T. This holds because by construction £ is true at all nodes of type (n, 0, jj). 

• 9 = p. We have Tpi \= p l(n ) = p l'{n , 2 ,p) = p 44>(since (n, 2 ,p) is the only child of 
(n, 1 , jj) labeled with p and (n, 1 , jj) is labeled by s ) 

T' (n, 1, jj) \= s A (l)p (since (n, 1, jj) is the only child of (n, 0, jj) and C is always true at 
(n,0,jj)) T', (n,0,jj) |= C A (|)(s A (|)p) T', (n,0,jj) j=p*. 
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• 9 = -i p. We have T, n \= -ip 4A Z(to) ^ p 4A l'(n,3,p) = p (since (to, 3,p) is the only 

descendent of (to, 1 , ft) labeled with p and (to, 1 ,( 1 ) is labeled by s) 

T 7 , (to, 1, )J) |= s A (|}(| + )p 4A (since (to, 1, JJ) is the only child of (to, 0, fj) and ( is always 
true at (to, 0, (()) T 7 , (to, 0, (t) |= C A (|)(s A (|)(| + )p) T\ (to, 0, (j) |= (~>p) 9 . 

• 0 = ipi A ip 2 - We have T, to |= p>\ A <^2 T, n \= p>\ and T, to (= 992 (by the induc¬ 
tion hypothesis) T 7 , (to, 0,j() |= ip\ and 1T / , (to, 0, J() |= </?* ^ X", (to, 0, (}) (= A ^ ^ 

T',(n, 0, (J) |= (^iA^)V 

• 9 = We only show the right to left direction. T 7 , (to, 0, (j) |= ((l)^)* 4A T 7 , (to, 0, (() |= 

C A (i)</?* <tA 3m E IV' such that (to, 0, K)E ,/ rro. and T 7 , m |= </?*. But then T 7 , rro |= £ because 
tp 9 implies £. Thus m must be of the form (to 7 , 0, (J) as only these make £ true. But then 
to 7 E N and nEn' and we may apply the inductive hypothesis to get T, n! |= ip, and thus 
T,to |= 

• 9 = {(p + )'ij>. We only show the right to left direction. T 7 , (to, 0, (J) (= ((ip + )'ip) 9 
T 7 , (to, 0, (t) |= C A (<p 9+ )'i/j 9 4A 3m E IV 7 such that (to, 0, K).E 7 + 7 n and T 7 ,m (= i/ ; * and 
VW.(to, 0, §)E ,+ m'E ,+ m it holds T' ,m! (= 99 *. Now, because <£>* implies £ for all m and 
all nodes m! are of the form (to 7 , 0, (j) and thus all in the original model N. Moreover they 
stand in the same way in the E relation. Thus we can apply the inductive hypothesis and 
obtain T, to |= (ip + )il>. 

As a special case, T 7 , (r, 0,))) satisfies y?* but not ij) 9 . Recall that T 7 , (r, 0, |) does not satisfy 
the other disjuncts AX of ip° either and thus, T 7 , (r, 0, (J) (= ip* and T 7 , (r, 0, (J) ■ 0 °, as 

desired. 

(=>) Again we show the contrapositive. Suppose there is a model T = (N, E, r, l ) 
satisfying 99 * but not ijj° at the root r. Then in particular T, r \/= AX. Without loss of 
generality, we can assume that the simulation from (the conditional tree pattern corresponding 
to) ip* into T is surjective. (Otherwise the image of ip 9 is a subtree of T and thus by 
monotonicity ip° cannot be satisfied at the root of this subtree.) In this model, as a 
consequence of the fact that ip 9 implies £, every branch has an initial segment satisfying £, 
immediately followed by a node labeled by s, which is followed by a segment where £ is 
never satisfied because of the first disjunct (4.1) of AX. 

We define a tree T' = {N ', E ', r', l') whose set of nodes N' consists of the nodes of T 
where £ is satisfied, i.e. N' = {to E N | T, to |= £}; E' is simply the restriction of E to N' 
and r 7 = r. We define Z 7 (n) = pi iff T, n |= p 9 , and if T, to \/= p 9 for any i, then we label 
to with a fresh variable z. This definition of Z 7 is well-defined because the falsity of the 
disjunction (4.3) in AX ensures that each node makes at most one p 9 true in T. 

Let 9 be a formula in variables {p\.... ,p n }, and to E N. By induction on 6 , we show 

that 


T,n\=9 9 iff T 7 , to |= 9. (4.4) 

• 9 = T. Then T, to |= T* 4A (by definition of (•)*) T, to |= £ 4A (by definition of N') 

n«NT. 

• 9 = p. Then T, to |= p 9 4A (by definition of the labeling Z 7 ) Z 7 (n) = p T 7 , n \= p. 

• 9 = -1 p. If T, to |= ( _, p)*, then by the falsity of AX, T, n Y= p 9 , and thus l'{n) 7 ^ p and 
T 7 ,to 1 /= p and thus T',n |= -ip. Conversely, T',n |= -1 p 4A Z'(to) 7 - p 4A T,n ^p’. But 
T, to |= £ and thus either p 9 or (-ip)* must hold at T, to. Thus T, to |= (-ip)V 

• 9 = ipi A (p 2 • Then T,n |= (<pi A P 2 )* T, n |= and T, to (= </?* (to E A ^ 7 because 
</ 9 * implies C and thus by the inductive hypothesis) T',n |= p>\ and T',n (= p >2 <=> 
T’,n \= ipi A ip 2 - 
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• 9 = (l}tp. Then T, to |= ((!)<£>)* 44 T, n \= ( and T, to |= (|}</?* 44 there exists to ' E N 
such that nEn! and T, n' |= (p* 44 (by the fact T, n! |= ( and, thus, n' E N' and the 
inductive hypothesis) there exists n' E N' such that nE'n' and T',n' j= <p 44 T',n |= (j,)<^, 
as desired. 

• 6 = ( 9 + )t. Assume T,n |= ((0 + )t)V Then, by definition of (•)*, T, to |= C A (# ,+ )r*. 
That means there exists n' in T with nE + n' such that T, n' (= t* and for all n" with 
nE + n"E + n' it holds T, to" |= 6*. By definition of (•)*, the nodes to, to' and all the nodes 
between n and n' satisfy ( and thus belong to T'. By inductive hypothesis, T',n' (= r 
and T',to" |= 9 for all nE + n"E + n' , which means T',n |= ( 9 + )t holds. Conversely, assume 
T',n |= ( 9 + )t. By definition of T 1 , we have that T,n (= which is the first conjunct of 
(( 9 + )t )*. The second conjunct follows by the inductive hypothesis. 

As T is a counterexample for <p* C -0°, (4.4) implies that T' is a counterexample of tp C ip, 
as desired. 

The same argument applies for multi-labeled trees. The only change is to remove the 
last disjunction (4.3) from AX. □ 


4.2. Lower bounds. In this section we show that the containment for TP is PSPACE 
hard. This lower bound will carry over to the containment problem for CTP. 


Theorem 4.5. 

(i) The containment problem for CTP is PSPACE-hard. 

(ii) The containment problem for TP -1 is PSPACE-Ziard, 

(iii) Both results also hold for multi-labeled trees. 


Proof, (i) follows from (ii) by Proposition |4.4[ (iii) follows from (i) and (ii) by Proposition [T3j 
For proving (ii), we reduce the corridor tiling problem, which is known to be hard for 
pspace [THU], to the containment problem for TPT We use the construction from the 
PSPACE-hardness proof for the containment problem of TP with disjunction over a finite 
alphabet in [16]. 

The corridor tiling problem is formalized as follows. Let Til = (D, H,V,b,t,n) be a 
tiling system, where D = {di ,..., d m } is a finite set of tiles, H,V C D 2 are horizontal and 
vertical constraints, n is a natural number given in unary notation, b and f are tuples over 
D of length n. Intuitively, given a corridor of width to, the goal is to construct a tiling 
of the corridor using the tiles from D so that the horizontal and vertical constraints are 
satisfied and the bottom and top rows are tiled by b and t, respectively. We say that a tiling 
satisfies the horizontal constraints Irrespectively the vertical constraints V) if for every 
1 < i < n, j E N it holds that if d\ and d2 are the tiles on the positions (z, j) and (i + 1, j) 
of the corridor (respectively (i,j) and (■ i,j + 1 )), then (d\,d2) E H ((d\,d2) E V ). 

Now we construct in PTime in the length of Til, two TP^ expressions tp and if such 
that the following holds: 


<p <2 ip iff there exists a tiling for Til. 


(4.5) 


By Proposition 4.2, we may without loss of generality construct if as a disjunction of TP^ 
expressions. To this purpose, we use the string representation of a tiling. Each row of the 
considered tiling is represented by the tiles it consists of. If the tiling of a corridor of width 
to has k rows, it is represented by its rows separated by the special symbol (j. The top row 
is followed by the symbol $. Thus, a tiling is a word of the form where 
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each Ui is the word of length n corresponding to the z-th row in the tiling. In particular 
u\ = b and Uk = t. For the sake of readability, for expression r, r* denotes the path formula 
?r; J,; ?r;...; ?r with i occurrences of r. 

Let ip be 

(?6 i; |;?6 2 ;...;|;?6 n ;|;? #; |+ ; ?f 1; j;... j; ?t„; |)$. 

Intuitively, this expression enforces a tiling to start with a path starting with b and finishing 
with t and the hnal symbol $. Now the formula if dehnes all incorrect tilings and additional 
constraints. It is the disjunction of the following TP^ formulas. 


(0) (| + ; ?(-ig?i A ... A -i d m A -if)); There is a position that is labeled neither by a tile 

nor a delimiter. 

(1) Incorrect length of a row. 

(la) VITO ii + ! ?it; T*; |))t, a row is too short; 

(lb) (| + ; (~ l i)) n+ 1 )T, a row is too long; 

(2) Y( di d2 ^ H (i + ; ?di; i; ?d 2 )T, some horizontal constraints are violated; 

(^) V(d, d o ) 0 v(l + ; ?g?i; T n ; ?g? 2 )T, some vertical constraints are violated. 


Note that negated labels are used in (0) and (lb). Also note that the size of (p and z/> is 
bounded by a polynomial in the size of Til. 

We now show (4.5). 

(<=)■ Assume that there exists a tiling of the corridor. Let s be the string representation 
of it. Then, s = zzij}zz 2 ti • • • where |zq| = n, Ui £ D n , u\ = b and Uk = t. Moreover, 
on the one hand if x ■ y, is an infix of some u,i , then (x, y ) £ H , and on the other hand for 
every infix x ■ v! ■ y of length n + 1 of u$ ■ u l+ \, it holds that (x,y) £ V. Let T s be the 
corresponding tree, i.e. a single path of |s| nodes {ui,... ,f| s |} where the labeling is set in 
accordance with s, i.e. l(vi) = Si. Clearly, T s is a model of ip and not of z/>. 

(=>). Let T be a tree such that T. r j= <p and T, r ifr. Since T,r \= ip, there must exist 
a path r = v 1,..., v m in T which starts with b and finishes with i$. Moreover, either (j or a 
symbol from D is the label of every node Vi, 1 < i < m, according to (0). 

We dehne a tiling function g : {0,..., n — 1 }xN —> D assigning a tile to every position in 
the corridor as follows: g(i,j) = l(vr n+ i\ X j +i+ i), 1 < i < n, where l is the labeling function 
of T. Indeed, this function is well defined, as (1) ensures the correct counting. By formulas 
(2) and (3) the tiling defined by g satisfies the horizontal and vertical constraints. 

□ 


The difference between CTP patterns and TP patterns is that CTP descendent edges 
can be labeled by patterns. One might ask whether a bound on the degree of such a labeling 
nesting can lead to a lower complexity of the containment problem. Define the until nesting 
depth un : CTP ->Nas follows. 

• un(p) = un(T) = 0 , 

• un((i)ip) = un((l + )<p) = 1 , 

• un(ip 1 A ip 2 ) = max{un((^i), un(<y? 2 )}, 

• un((ip + )'i/j) = un(ip) + 1 . 

Unfortunately, a close examination of the encoding in the lower bound proof and the 
encodings in Propositions 4.2 and 4.4 gives a negative answer to the question. Thus we 
obtain 


Theorem 4.6. The CTP containment problem for formulas of until nesting depth one is 
PSPACE-hard. 
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4.3. Upper bounds. In this section, we show a matching PSpace upper bound for CTP 
containment. 

The complexity of CTP^ ,V containment follows from a translation into existential CTL 
(3CTL), whose containment problem is known to be PSPACE-complete [IT]. The only 
small technical issue is that 3CTL formulas are interpreted over infinite finitely branching 
trees. We solve that by relativizing formulas with a new propositional variable s, whose 
interpretation will provide the desired finite tree. 

Theorem 4.7. The containment problem for CTP^ ,V is decidable in PSpace. This also 
holds for multi-labeled trees. 


Proof. We prove the upper bound for the case of multi-labeled trees. The upper bound for 


single-labeled trees then follows by Proposition 4.3 


We recall from im the definition of 3CTL. Let Prop be a set of propositional variables. 
3CTL node formulas p and path formulas a are defined by: 


p ::= T | T | p \ -<p \ p A p \ p V p \ 3a 
a ::= ip \ a A a \ a V a \ Xa | ( aUa ) | ( aUa ) 

where p £ Prop, and asking that in node formulas, the operators A'(“next”), U( “until”) and 
[/(“dual of until”) are always immediately preceded by 3. 

The semantics for 3CTL is given by infinite finitely branching trees. 3CTL path 
formulas are interpreted at a (possibly infinite) paths of the tree and 3CTL node formulas 
are interpreted at nodes of the tree. Our reduction will translate CTP _,,V formulas to 3CTL 
node formulas, with occurrences of formulas of the form 3Xp and 3fiUp. Semantics of such 
formulas is defined as follows, given an infinite tree T = (N, E. r, p ) and a node n £ N. 


T,n \= 3 X<p iff there exists a node m £ N such that nEm and T, m |= p, 

T,n \= 3 fiUp iff there exists a node m £ N such that nE*m and T, m \= <p, 
and for all n' £ N with nE*n'E + m it holds T, n \= if. 

The CTP node formula (p + )ip corresponds to the strict until operator, which is expressible 
in 3CTL as 3X3ipUp. 

We denote by the containment relation for 3CTL node formulas over finitely 
branching trees where each branch is infinite. 

Let ip,if be in CTP^ V and let s be a new propositional variable not in tp, if. The 
translation (-) s from CTP^’ V to 3CTL node formulas relativizes every subformula with s 
and adjusts the syntax. 


(T) s 

= sAT 

C p) s 

= s Ap 

hp) s 

= sA-ij) 

(<pi a piy 

= <Pi A p s 2 

(Pl V P2) S 

= TiVT S 2 

((UtY 

= s A 3 Xp s 

((* + )pY 

= sA3X3fi s Up’ 
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We claim that p C i/; iff p s ip s . 

The proof is by contraposition. First assume that for some finite tree T = (TV. E, r, p) 
and node n G TV it holds T,n |= <p and T,n \/= ip. We then construct an infinite tree 
which is a counterexample. Let s be a propositional variable not occurring in p or ip. 
Let be the infinite tree obtained from T by adding to each leaf in T an infinite path. 
The labeling is changed as follows: all new nodes have the empty label set and we add 
the label s to the label set of all old nodes. Formally, T£, = (TV*,, E^pr, Poc), where 
Noo = TV U {n™ , n™,... | m is a leaf in T} is the set of nodes, E^ = E U {(m, n r { L ) \ n™ G 
Noo} U {(^f l ’ n i+i) I n T’ n i +1 ^ TVq 0 ,z > 1} is the set of edges, r is the root and is the 
labeling function defined as follows. 

, . ( p(n) U {s} if n G TV, 

ifn£N^\N. 

We show by induction on the structure of the formula that for every node n G TV and 
every CTP^' V formula 9, T,n\= 9 iff T^,n \= 9 s . 

To show this we will use the following two facts which are easy consequences of the 
definitions of and the translation (-) s : 

T^,n \= s n G TV, (*) 

and 

T^n \=p s => T^n^s. (**) 

• 9 = T. We have T, n |= T iff n G TV iff n G TVoo and s € Poo(n) iff T^n |= s A T iff 
^,n|=(T)L 

• 9 = p,p G E. We have T,n |= p iff p E p(n) iff (since n E TV) {p, s} C Poc pri) iff 
T^,n\= s/\ p iff T^,n \= (p) s . 

• 9 = -ip,p G £. We have T, n (= -p iff p 0 p(n) iff (since n E TV and, thus, s G Poo(n)) 
P 0 Poo(^) and s G Poo(n) iff T^, n \= s A -ip iff 7^, n |= (-p)L 

• 9 = ipi A ip 2 - We have T, n |= <pi A <p 2 iff T, n \= p\ and T, n |= <p 2 iff (by the induction 
hypothesis, (*) and (**)) T^,n (= pf and T^,n |= <p| iff T^,n |= <pf A <p| iff T^,n |= <p s . 

• 0 = p\ V <p 2 - By the same argument as for <pi A <p 2 - 

• 9 = (Dy?. We have T, n \= (4_)<p iff there exists m G TV such that nErri and T,m \= p iff 
(by the induction hypothesis, (*), (**) and the fact that n G TV) T^,n |= s and there 
exists m G TVoo such that nE^m and T^, m (= <p s iff T^, n j= s A EW(p s iff J= ((\.}p) s . 

• 9 = ( ip + )p . We have T,n |= (ip + )p iff there exists m G TV such that T, to |= <p and for 
all n! G TV with nE + n'E + m it holds T, n' |= ip. Then by the induction hypothesis, (*) 
and (**), the latter is equivalent to existence of m G TVoo such that Tp^, m |= p s and for 
all n' G TVqq with nE^n'E^m it holds \= ip s . The latter implication holds because 
for every node n' with nEp^n'Ep^m it holds n' G TV and nE + n!E + m. Equivalently, there 
exists ri\ G TV^ and m G TVoo such that nE^niEp^m, Tp^. m \= p s and for all n' with 
niEp^n'E^m. it holds Tp^ , n! (= ip s . This is equivalent to Tp^ , n \= s A 3X3ip s Up s as 
desired. 

Thus, we obtain both T^,n |= p s and T'P ^, n (A ip s . 

For the other direction, let = (TVqo, Eoo, r, poo) be some infinite tree over £ such 
that T°°,r |= 99 s and T°°,r \/= ip s . We then remove from the model all nodes without s 
and their descendants. Formally, T = (TV, E,r, p), where TV = TV^ \ {n | 3n! G TVqo .s £ 
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Poo{n') A n'E^n} is the set of nodes, E = Eoo\nxN is the set of edges, r is the root, and 
p(n) = Poo {n) is the labeling function. 

By induction we can show that for every CTP -1 ^ formula 6 and node n E N, it holds 
,n |= 6 s if and only if T,n |= 6 s . The two non-trivial cases are the following. 

• 6 = (|}</?. We have Too,n \= s A 3Xtp s iff Too,n |= s and there exists m E Noc such 
that nEoom and Too , rn (= ip s iff T,n\= s and there exists m & N such that nEm and 
T, m |= p s . The direction from right to left is obvious, while the direction from left to right 
because of the following. Since Too,m \= ip s , it follows that , m \= s, i.e. s E Pooijn). 
Then since n E N, we have that m E N too by the definition. The latter also implies that 
nEm holds in T and thus that T, n (= s A 3Xip s . 

• 6 = (ip + )<p. We have T^, n ^sA 3X3if s U<p s iff Too, n \= s and there exists m E Noo such 
that Too rn (= T s , nEf^rri and for all n! E Xoo with nT+n'T+m it holds Txj, n 1 \= ip s . 
This is equivalent to T,n\= s and there exists m E N such that T, m |= tp s and for all n' 
with nE + n'E + m it holds T, m (= ip s , which means T, n |= s A 3X3'i/j s Uip s as desired. The 
direction from left to write is trivial since T is a substructure of Too- The direction from 
right to left follows from the fact that m E N and all the nodes n! such that nE^n'E^m 
belong to N as well and, moreover, it holds nE + n'E + m. 

Thus, we obtain that T,r \= ip s and T,r \/= . Because all nodes of T now make s true, we 

can discard the relativization with s in the formulas. Thus obtaining T,r \= <p and T,r \/= ip. 
The only problem is that T is infinite. But using the simulations from Theorem |3.3| it is 
easy to see that we can turn T into a finite counterexample. 

Thus, we have shown that <p C ^ if and only if <p s ^. The latter containment 
problem in 3CTL is known to be decidable in PSpace. This concludes the proof. □ 

Corollary 4.8. The containment problem for TP^ is PSpace complete. 

However, restricting negation to a safe negation p A -k/i A ... A -i q n keeps the complexity 
of the containment problem for tree patterns in CONP. Note that p A ~>q\ A ... A ~>q n 
only adds expressive power over multi-labeled models, because on single labeled models 
p = p A -igi A ... A -'qn- The proof of the following result can be found in [Hj. Here TP V,_ ' 
is tree patterns expanded with disjunction and the safe negation construct. 

Theorem 4.9. The containment problem for TP V ’^ over multi-labeled, trees is in CONP. 


5. Conclusion 

We have shown that adding conditions on the edges of tree patterns gives a boost in expressive 
power which comes with the price of a higher - PSpace - complexity for the containment 
problem than for tree patterns. We located the source of the extra complexity in the fact 
that unrestricted negations of labels can be coded in Conditional Tree Patterns. Adding 
negations of labels to tree patterns causes an increase in complexity of the containment 
problem from CONP to PSpace. 

This paper is a first step in exploring “Regular Tree Patterns” and fragments of it. We 
mention some directions for future work. 

Miklau and Suciu in m mention that existence of a homomorphism between tree 
patterns is a necessary but not sufficient condition for containment in TP. Can we extend 
the simulations between conditional tree patterns and trees to simulations between queries, 
partly capturing containment as for tree patterns? 
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What is the complexity of containment of regular tree patterns, i.e. the positive fragment 
of Regular XPath without disjunction and union? As the satisfiability (and thus also the 
containment) problem for Regular XPath is known to be ExpTlME-complete, it must lie in 
between PSpace and ExpTime. 

There are also interesting characterization questions: what is the exact FO fragment 
corresponding to CTP or CTP with disjunction and union? Ten Cate m showed that 
full Regular XPath (with all four axis relations) expanded with path equalities is equally 
expressive as binary FO* (First-order logic extended with a transitive closure operator that 
can be applied to formulas with exactly two free variables). Is every positive forward binary 
FO* formula expressible as a Regular Tree Pattern? Is CTP with disjunction and union 
equally expressive as FO intersected with positive FO*? Are unions of CTP equivalent to 
unions of the first order fragment of conjunctive regular path queries [6j? 
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Appendix A. Translations between CTP and ctp 

The following constructions are similar to the tree and graph representations for tree patterns 
and conjunctive queries over trees mm 

We define the translation function c(-) which assigns an equivalent conditional tree 
pattern with one output node to every CTP formula. The output node of c(cp), where cp is a 
node formula, equals the root. 

Let a and p be path and node CTP formulas. We define c(a) and c(p) by mutual 
induction on the complexity of the path and node formulas. We take c(ct) or c(</?) to be the 
tree (N, E,r,o, pn, Pe), where the components are defined according to the cases. 

• a = 4 -. Then N consists of two nodes r and n. The edge relation E is defined as {(r, n)}. 
Moreover, o := n, Pn{v) = 0 for v G {r, n} and pE({r,n}) =|, 

• a =?<p. Then c(a) := c(ip), 

• a = ai; 0:2- Let c(«i) and 0(012) be the conditional tree patterns for aq and 02- Then 
c(a) is the tree obtained as follows. We fuse the root of 0(0:2) with the output node of 
c(qi) and declare the output node of 0(02) as the output node of c(a). The label of the 
fusion node is the union of the labels of the output node of 0(0:1) and the root of 0(0:2)- 
Labels of other nodes and the edges remain the same as in 0(01) and 0(0:2)- 

• a = (|; ?<£>)*; Then N consist of two nodes r and n. The edge relation E is defined as 

{(r, n)}. Moreover, o := n, pn(v) = 0 for v G {r, n} and pE({r,n}) = c(<p). 

For node formulas, the output node of the translation result is always defined as the root r. 

• ip = p. Then N consists of a single node r, E is empty, the labeling pAr(r) = {p}. 

• tp = T. Similar to the previous case, with the exception that Pn{j~) = 0 

• tp = ip x a ip2- Then c(ip) := c(<p\) © 0(1^2), he. the fusion of the conditional tree patterns 

c((^i) and c(</? 2 )- 

• p = {a)(pi. Let c(a) and c(</?i) be the corresponding conditional tree patterns. Then c (ip) 
is obtained by fusing the output node of c(a) with the root of c(<p). The labeling of the 
fusion node is defined as the union of the labels of the root of c((p) and the output node 
of c(a). Labels of the other nodes and edges remain the same as in c(a ) and c (ip). 

Translation /(•) works the other way around. Let t = (N,E,r,o, pw, Pe) be a conditional 
tree pattern with one output node. We define fit) by induction on the nesting depth and 
the depth of the tree. We first define a mapping <p from nodes v 6 N to CTP node formulas. 
The mapping is defined inductively starting from the leaves as follows. Here, for a finite set 
S = {p\,... ,p n }, by AS we denote the finite conjunction p\ A ... A p n . We take A0 to be T. 
For a conditional tree pattern t, by rq we denote the root of t. For v a node in pattern t: 

<p(v) = A p N (v ) A 

A A 

(v,v')GE, pE(v,v')=t' 

A 

{v,v')eE, p E {v,v')=i 
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Now let r = v±,..., v n = o be the path from the root r to the output node o in t. Let the 
expression di, 1 < i < n — 1 be defined by: d* if pE{vi, fi+i) =| and d % = (|; ?¥>(ry))*; | 
if pE(vi,Vi+ 1 ) = t !. Then the result of the translation f(t) of the conditional tree pattern t 
is the CTP path formula: 


?<p(pl); dr, ?9?(u 2 ); d 2 ;...; ?ip(v n -i); d n _i; ?^(u n ). 

For the next proposition we need the dehnition of equivalence between conditional 
tree patterns. Let 1 be a conditional tree pattern with output nodes o,|o|=fc, and T 
a tree. Then the answer set of t over T is the set Out(t,T ) = {(<7(01 g{ok)) \ 
g is a simulation of t in T}. We say that two conditional tree patterns t\ and f 2 are equiva¬ 
lent, denoted as t\ — t 2 , if Out(t\,T ) = Out(t- 2 ,T ) for every tree T. 

Proposition A.l. Let and a be CTP node and path formulas, t a conditional tree pattern. 
Then it holds that 

(i) /«¥>)) = <P and /( c («)) = a, 

(ii) c (f(t)) ~ t. 

The proof of (i) is by mutual induction on a and cp, and the proof of (ii) is by induction on 
nesting depth and depth of t. 
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